Tiered Clustering to Improve Lexical Entailment
نویسنده
چکیده
Many tasks in Natural Language Processing involve recognizing lexical entailment. Two different approaches to this problem have been proposed recently that are quite different from each other. The first is an asymmetric similarity measure designed to give high scores when the contexts of the narrower term in the entailment are a subset of those of the broader term. The second is a supervised approach where a classifier is learned to predict entailment given a concatenated latent vector representation of the word. Both of these approaches are vector space models that use a single context vector as a representation of the word. In this work, I study the effects of clustering words into senses and using these multiple context vectors to infer entailment using extensions of these two algorithms. I find that this approach offers some improvement to these entailment algorithms.
منابع مشابه
A Mixture Model with Sharing for Lexical Semantics
We introduce tiered clustering, a mixture model capable of accounting for varying degrees of shared (context-independent) feature structure, and demonstrate its applicability to inferring distributed representations of word meaning. Common tasks in lexical semantics such as word relatedness or selectional preference can benefit from modeling such structure: Polysemous word usage is often govern...
متن کاملLearning Parse-Free Event-Based Features for Textual Entailment Recognition
We propose new parse-free event-based features to be used in conjunction with lexical, syntactic, and semantic features of texts and hypotheses for Machine Learning-based Recognizing Textual Entailment. Our new similarity features are extracted without using shallow semantic parsers, but still lexical and compositional semantics are not left out. Our experimental results demonstrate that these ...
متن کاملWHU at TAC 2009: A Tri-categorization Approach to Textual Entailment Recognition
This paper describes our system of recognizing textual entailment for RTE-5 challenge at TAC 2009. We propose a textual entailment recognition framework and implement a system of classification which takes lexical, syntactic and semantic features as considered. To improve the performance, some lexical-semantic resources and web knowledge bases are also incorporated in the system. Official resul...
متن کاملThe Distributional Inclusion Hypotheses and Lexical Entailment
This paper suggests refinements for the Distributional Similarity Hypothesis. Our proposed hypotheses relate the distributional behavior of pairs of words to lexical entailment – a tighter notion of semantic similarity that is required by many NLP applications. To automatically explore the validity of the defined hypotheses we developed an inclusion testing algorithm for characteristic features...
متن کاملLarge-Scale Acquisition of Entailment Pattern Pairs by Exploiting Transitivity
We propose a novel method for acquiring entailment pairs of binary patterns on a large-scale. This method exploits the transitivity of entailment and a self-training scheme to improve the performance of an already strong supervised classifier for entailment, and unlike previous methods that exploit transitivity, it works on a largescale. With it we acquired 138.1 million pattern pairs with 70% ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1412.0751 شماره
صفحات -
تاریخ انتشار 2013